different sample size
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- North America > United States > New York (0.04)
- Asia > Middle East > Jordan (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Overview (0.67)
Figure 1: Angle dynamics (left) and policy
Thank reviewers for the comments. Please find our responses below, with reference indices consistent with the paper . Q3-5: Meaning of the learned divergence? We agree that BC minimizes the policy KL divergence as what we noted in Sec. 4 (line 200). It is consistent with the literature, e.g., Sec. 2 in [Y u et al. arXiv:1909.09314].
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- North America > United States > New York (0.04)
- Asia > Middle East > Jordan (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
- Overview (0.67)
Proofs and Additional Numerical Experiments for " Nonuniform Negative Sampling and Log Odds Correction with Rare Events Data "
Slutsky's theorem together with (S.3) and (S.5) implies the result in Theorem 1. Now we check the Lindeberg-Feller condition. 's are non-negative and E S.4 Derivation of corrected model (4) Note that π (x, 1) = 1 and π (x, 0) = π (x) . Slutsky's theorem together with (S.15) and (S.17) implies the result in Theorem 1. 's, whose distribution depends on N . From (S.27) and (S.28), Chebyshev's inequality implies that For sampled data, (5) tell us that the joint density w.r.t. the product counting measure of the responses The outline of the proof is similar to that of the proof of Theorem 2. Write Markov's inequality shows that they are both o The outline of the proof is similar to that of the proof of Theorem 4. The estimator Slutsky's theorem together with (S.38) and (S.40) implies the result in Theorem 1.
TabEBM: A Tabular Data Augmentation Method with Distinct Class-Specific Energy-Based Models
Margeloiu, Andrei, Jiang, Xiangjian, Simidjievski, Nikola, Jamnik, Mateja
Data collection is often difficult in critical fields such as medicine, physics, and chemistry. As a result, classification methods usually perform poorly with these small datasets, leading to weak predictive performance. Increasing the training set with additional synthetic data, similar to data augmentation in images, is commonly believed to improve downstream classification performance. However, current tabular generative methods that learn either the joint distribution $ p(\mathbf{x}, y) $ or the class-conditional distribution $ p(\mathbf{x} \mid y) $ often overfit on small datasets, resulting in poor-quality synthetic data, usually worsening classification performance compared to using real data alone. To solve these challenges, we introduce TabEBM, a novel class-conditional generative method using Energy-Based Models (EBMs). Unlike existing methods that use a shared model to approximate all class-conditional densities, our key innovation is to create distinct EBM generative models for each class, each modelling its class-specific data distribution individually. This approach creates robust energy landscapes, even in ambiguous class distributions. Our experiments show that TabEBM generates synthetic data with higher quality and better statistical fidelity than existing methods. When used for data augmentation, our synthetic data consistently improves the classification performance across diverse datasets of various sizes, especially small ones. Code is available at https://github.com/andreimargeloiu/TabEBM.
- Europe > United Kingdom > England > Cambridgeshire > Cambridge (0.14)
- North America > United States > New York (0.04)
- Asia > Middle East > Jordan (0.04)
- Research Report > New Finding (1.00)
- Research Report > Experimental Study (1.00)
Causal Responder Detection
Frostig, Tzviel, Machluf, Oshri, Kamber, Amitay, Berkman, Elad, Pryluk, Raviv
Personalized medicine is expected to advance healthcare in the near future [Vicente et al., 2020]. In contrast to a one-size-fits-all approach, personalized medicine advocates for treatments tailored to individual patients based on their clinical characteristics. Responder analysis in clinical trials is a method used to evaluate the effectiveness of a treatment by identifying and analyzing the subset of participants who respond significantly to the treatment [Henschke et al., 2014]. This approach contrasts with the traditional method of evaluating average effects across all participants, which can sometimes obscure the benefits seen in a particular group of responders [Guyatt et al., 1998]. This approach is particularly important in trials with heterogeneous treatment effects, where understanding individual responses can lead to more effective therapies. Responder analysis is a common practice in clinical trials [Moore et al., 2010, Straube et al., 2010, Chuang et al., 2022].
Assessing Intra-class Diversity and Quality of Synthetically Generated Images in a Biomedical and Non-biomedical Setting
Saad, Muhammad Muneeb, Rehmani, Mubashir Husain, O'Reilly, Ruairi
In biomedical image analysis, data imbalance is common across several imaging modalities. Data augmentation is one of the key solutions in addressing this limitation. Generative Adversarial Networks (GANs) are increasingly being relied upon for data augmentation tasks. Biomedical image features are sensitive to evaluating the efficacy of synthetic images. These features can have a significant impact on metric scores when evaluating synthetic images across different biomedical imaging modalities. Synthetically generated images can be evaluated by comparing the diversity and quality of real images. Multi-scale Structural Similarity Index Measure and Cosine Distance are used to evaluate intra-class diversity, while Frechet Inception Distance is used to evaluate the quality of synthetic images. Assessing these metrics for biomedical and non-biomedical imaging is important to investigate an informed strategy in evaluating the diversity and quality of synthetic images. In this work, an empirical assessment of these metrics is conducted for the Deep Convolutional GAN in a biomedical and non-biomedical setting. The diversity and quality of synthetic images are evaluated using different sample sizes. This research intends to investigate the variance in diversity and quality across biomedical and non-biomedical imaging modalities. Results demonstrate that the metrics scores for diversity and quality vary significantly across biomedical-to-biomedical and biomedical-to-non-biomedical imaging modalities.
- Europe > Ireland > Munster > County Cork > Cork (0.04)
- Asia > Middle East > Israel > Tel Aviv District > Tel Aviv (0.04)
Vehicle Dynamics Modeling for Autonomous Racing Using Gaussian Processes
Autonomous racing is increasingly becoming a proving ground for autonomous vehicle technology at the limits of its current capabilities. The most prominent examples include the F1Tenth racing series, Formula Student Driverless (FSD), Roborace, and the Indy Autonomous Challenge (IAC). Especially necessary, in high speed autonomous racing, is the knowledge of accurate racecar vehicle dynamics. The choice of the vehicle dynamics model has to be made by balancing the increasing computational demands in contrast to improved accuracy of more complex models. Recent studies have explored learning-based methods, such as Gaussian Process (GP) regression for approximating the vehicle dynamics model. However, these efforts focus on higher level constructs such as motion planning, or predictive control and lack both in realism and rigor of the GP modeling process, which is often over-simplified. This paper presents the most detailed analysis of the applicability of GP models for approximating vehicle dynamics for autonomous racing. In particular we construct dynamic, and extended kinematic models for the popular F1TENTH racing platform. We investigate the effect of kernel choices, sample sizes, racetrack layout, racing lines, and velocity profiles on the efficacy and generalizability of the learned dynamics. We conduct 400+ simulations on real F1 track layouts to provide comprehensive recommendations to the research community for training accurate GP regression for single-track vehicle dynamics of a racecar.
- Asia > China > Shanghai > Shanghai (0.06)
- North America > United States > Virginia (0.05)
- North America > United States > Pennsylvania > Allegheny County > Pittsburgh (0.04)
- (3 more...)
- Transportation (1.00)
- Automobiles & Trucks (1.00)
- Leisure & Entertainment > Sports > Motorsports > Formula One (0.89)